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Abstr act. A pair of lower and upper cumulative distribution functions, also called probability 
box or p-box, is among the most popular models used in imprecise probability theory. They 
arise naturally in expert elicitation, for instance in cases where bounds are specified on the 
quantiles of a random variable, or when quantiles are specified only at a finite number of points. 
Many practical and formal results concerning p-boxes already exist in the literature. In this 
paper, we provide new efficient tools to construct multivariate p-boxes and develop algorithms 
to draw inferences from them. For this purpose, we formalise and extend the theory of p-boxes 
using Walley's behavioural theory of imprecise probabilities, and heavily rely on its notion of 
natural extension and existing results about independence modeling. In particular, we allow 
p-boxes to be defined on arbitrary totally preordered spaces, hence thereby also admitting 
multivariate p-boxes via probability bounds over any collection of nested sets. We focus on the 
cases of independence (using the factorization property), and of unknown dependence (using 
the Frechet bounds), and we show that our approach extends the probabilistic arithmetic 
of Williamson and Downs. Two design problems — a damped oscillator, and a river dike — 
demonstrate the practical feasibility of our results. 



1. Introduction 

Imprecise probability |41] refers to uncertainty models applicable in situations where the 
available information does not allow us to single out a unique probability measure for all random 
variables involved. Examples of such models include 2- and n-monotone capacities (Jjl, lower 
and upper previsions [UJ EH HI] , belief functions [35] , credal sets [3T] , possibility and necessity 
measures [23l [6] , interval probabilities [42] , and coherent risk measures [IJ [15] . 

Unlike classical probability models, which are described by probability measures, imprecise 
probability models require more complex mathematical tools, such as non-linear functionals and 
non-additive measures |41j . It is therefore of interest to consider particular imprecise probability 
models that yield simple mathematical descriptions, possibly at the expense of generality, but 
gaining ease of use, elicitation, and graphical representation. 

One such model is considered in this paper: pairs of lower and upper distribution functions, 
also called probability boxes, or briefly, p-boxes [Ml Hi] . P-boxes are often used in risk or safety 
studies, in which cumulative distributions play an essential role. Many theoretical properties 
and practical aspects of p-boxes have already been studied in the literature. Previous work 
includes probabilistic arithmetic 145,, which provides a very efficient numerical framework for 
particular statistical inferences with p-boxes (and which we generalise in this paper). In |26) . 
p-boxes are connected to info-gap theory [5] . The relation between p-boxes and random sets was 
investigated in [30J and applied in [35]: many results and techniques applicable to random sets 
are also applicable to p-boxes. Finally, a recent extension of p-boxes to arbitrary finite spaces |19j 
yields potential application of p-boxes to a much more general set of problems, such as robust 
design analysis [20j [28] , and signal processing [21] . 
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Iii this paper, we study p-boxes within the framework of the theory of coherent lower pre- 
visions. Coherent lower previsions were introduced by Williams |43j as a generalisation of de 
Finetti's work [TT], and were developed further by Walley [5T]. Coherent lower previsions gen- 
eralize many of the other imprecise probability models in the literature, and are equivalent to 
closed convex sets of finitely additive probability measures. 

Studying p-boxes by means of lower previsions has at least two advantages: 

• Lower previsions can be defined on arbitrary spaces, and thus enable p-boxes to be used 
for more general problems, and not just problems concerning the real line. 

• Lower previsions come with a powerful inference tool, called natural extension, which 
reflects the least-committal consequences of any given assessments. Natural extension 
generalises many known extensions, including for instance Choquet integration for 2- 
monotone measures. In this paper, we study the natural extension of a p-box, and we 
derive a number of useful expressions for it. This leads to new numerical tools that 
provide exact inferences on arbitrary events, and even on arbitrary (bounded) random 
quantities. 

From the point of view of coherent lower previsions, p-boxes have already been studied briefly 
in 41, Section 4.6.6] and [39 . Lower and upper distribution functions associated with a sequence 
of moments have also been considered |33j . 

As already mentioned, Q15] extended p-boxes to finite totally preordered spaces. In this paper, 
we extend p-boxes further to arbitrary totally preordered spaces. Our generalisation has many 
useful features that classical p-boxes do not have: 

• We encompass, in one sweep, p-boxes defined on finite spaces, as well as (continuous) 
p-boxes on closed real intervals. 

• Perhaps even more importantly, as we do not impose anti-symmetry on the ordering, 
we can also handle product spaces by considering an appropriate total preorder — for 
instance, one induced by a metric — and thus also admit multivariate non-finite p-boxes, 
which have not been considered before. Whence, we can specify p-boxes directly on the 
product space. Contrast this with the usual multivariate approach to p-boxes, such as 
probabilistic arithmetic |45j , that consider one marginal p-box per dimension and draw 
inferences from a joint model built around some information about variable dependencies 
(of course, we can still do the same, and will do so in Section [7]). 

• Our approach is also useful in elicitation, as it allows uncertainty to be expressed as 
probability bounds over any collection of (possibly multivariate) nested sets — also see 
[28l H"8] for a discussion of similarly constructed models. Thus, unlike classical p-boxes, 
we are not restricted to events of the type [—00, x], even on the real line. 

Our approach is thus rather different, and far more general, than the one usually considered 
for inferences with p-boxes. Indeed, we first define a joint p-box over some multivariate space of 
interest, either directly or by using marginal models and a dependence model, after which we draw 
exact inferences from this joint p-box using natural extension. In contrast, usual methods (36) 
such as for instance probabilistic arithmetic |45) start out with marginal p-boxes each defined on 
the real line, and provide tools to make inferences for specific multivariate events. 

The paper is organised as follows: Section [2] provides a brief introduction to the theory of 
coherent lower previsions, used in the rest of the paper. Section [3] then introduces and studies 
the p-box model from the point of view of lower previsions. Section 2] provides an expression 
for the natural extension of a p-box to all events, via the partition topology induced by the 
equivalence classes of the preorder and additivity on full components. Section [5] studies the 
natural extension to all gambles, via the Choquet integral. Section [5] studies an important special 
case of p-boxes whose preorder is induced by a real-valued mapping, as this will usually be the 
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most convenient way to specify a multivariate p-box. Section [7] discusses the construction of 
such multivariate p-boxes from marginal coherent lower previsions under arbitrary dependency 
models. For two important special cases — epistemic independence, and completely unknown 
dependence (the Frechet case) — closed expressions are derived. We also derive probabilistic 
arithmetic as a special case of our approach. Section [5] demonstrates the theory with some 
examples. As a first example, we infer bounds on the expectation of the damping ratio of a 
damped harmonic oscillator whose parameters are described by a multivariate p-box. As a second 
example, we derive the expected overflow height for a river dike, again using a multivariate p-box. 
Finally, Section [9] ends with our main conclusions and open problems. 



We start with a brief introduction to the imprecise probability models that we shall use in 
this paper. We refer to [I], [13] and [JT] for more details. See also [32 for a brief summary of 
the theory. 

Let f2 be the possibility space. A subset of f2 is called an event. A gamble on SI is a bounded 
real- valued function on ft. The set of all gambles on SI is denoted by C(£l), or simply by L if 
the possibility space is clear from the context. A particular type of gamble is the indicator of an 
event A, which is the gamble that takes the value 1 on elements of A and the value elsewhere, 
and is denoted by I a, or simply by A if no confusion with A as event is possible. 

A lower prevision P is a real-valued functional defined on an arbitrary subset JC of £, and 
is interpreted as follows: for any gamble / in JC and any e > 0, the transaction / — P(f) + e 
is acceptable to the subject who has assessed this lower prevision. Hence, lower previsions 
summarize a subject's supremum buying prices for a collection of gambles, and it can be argued 
that in this way they model a subject's belief about the true state x in ft (see [H] for a more 
in-depth explanation). A lower prevision defined on a set of indicators of events is usually called 
a lower probability. 

By P, we denote the conjugate upper prevision of P: for every / such that — / e JC, P(f) = 
— P(— /)■ The upper prevision P(f) can be interpreted as a subject's infimum selling price for 



A lower prevision on JC is called coherent (see [43j p. 5] and J4TJ pp. 73-75, Sec. 2.5]) when 
for all p in N, all /q, /i, . . . , f p in JC and all non- negative real numbers Aq, Ai, . . . , A p , 



A lower prevision on the set C of all gambles is coherent if and only if (see [43j p. 11, Sec. 1.2.2] 
and gU p. 75, Sec. 2.5.5]) 




for all gambles /, g and all non- negative real numbers A. 

A functional on C satisfying P(f) ^ inf / and P(f + g) = P(f) +P(g) for any pair of gambles 
/ and g is called a linear prevision on C |411 p. 88, Sec. 2.4.8], and the set of all linear previsions 
on C is denoted by V . A linear prevision is the expectation operator with respect to its restriction 
to events, which is a finitely additive probability. 

An alternative characterisation of coherence via linear previsions goes as follows. Let P be a 
lower prevision on JC and let A4 (P) denote the set of all linear previsions on £ that dominate P 



2. Preliminaries 



/■ 



sup J]Xi(fi(x)-P(fi))-Mfo(x)-P(fo)) >0. 




(CI) £(/)>inf/, 

(C2) P(A/) = AP(/), and 

(C3) P(f + g)>P(f) + P(g) 



on JC: 



M(P) = {QeV: (V/ e K)(Q(/) > P(f))} 
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Then P is coherent if and only if P agrees with the lower envelope of M(P) on fC, that is, if 
and only if P(f) = mm QeM (£> Q(f) for all / 6 K (see [H p. 18] and [41, p. 138, Sec. 3.3.3]). 
A consequence of this is that a lower envelope of coherent lower previsions is again a coherent 
lower prevision. 

Given a coherent lower prevision P on /C, its natural extension [411 Chapter 3] to a larger set 
/Ci of gambles (fCx 2 JC) is the pointwise smallest coherent lower prevision on JCi that agrees with 
P on JC. Because it is the pointwise smallest, the natural extension is the most conservative (or 
least-committal) coherent extension, and thereby reflects the minimal behavioural consequences 
of Pon JCi. 

Taking natural extension is transitive [39l p. 98, Cor. 4.9]: if E_ Y is the natural extension of P 
to JCi and E_ 2 is the natural extension of E_ x to JC% 12 JCi, then E_ 2 is a l so the natural extension 
of P_ to K-2- Hence, if we know the natural extension of P to the set of all gambles then we also 
know the natural extension of P to any set of gambles that includes JC. The natural extension 
to all gambles is usually denoted by E. It holds that E(f) = minQ e M(P) Q(f) for any / e £ [3TJ 
p. 136, Sec. 3.4.1]. 

A particular class of coherent lower previsions of interest in this paper are completely monotone 
lower previsions [101 [9]. A lower prevision P defined on a lattice of gambles JC, i.e., a set of 
gambles closed under point-wise maximum and point- wise minimum, is called n- monotone if for 
all p e N, p ^ n, and all /, fx, . . . , f p in JC: 



In this section, we introduce the formalism of p-boxes defined on totally preordered (not 
necessarily finite) spaces, as well as the field of events H, which will be instrumental to study the 
natural extension of p-boxes. Hence, in contrast to the work by [24] . we do not restrict p-boxes 
to intervals on the real line. 

Let (ft, <) be a total preorder: so < is transitive and reflexive and any two elements are 
comparable. We write x < y for x < y and x y, x > y for y < x, and x ^ y for x < y and 
y < x. For any two x, y e CI exactly one of x < y, x ~ y, or x > y holds. We also use the 
following common notation for intervals in 57: 



and similarly for [x,y) and (x, y]. 

For simplicity, we assume that ft has a smallest element On and a largest element In- This is 
no essential assumption, since we can always add these two elements to the space fl. 

A cumulative distribution function is a mapping F : ft — > [0, 1] which is non-decreasing and 
satisfies moreover F(1q) = 1. For each x e ft, we interpret F(x) as the probability of the interval 
[On, x]. Note that we do not impose F(0q) = 0, so we allow {On} to carry non-zero mass, which 
happens commonly if ft is finite. Also note that distribution functions are not assumed to be 
right-continuous — this would make no sense since we have no topology defined yet on ft — but 
even if there is a topology for f2, we make no continuity assumptions. 




A lower prevision which is n-monotone for all n e N is called completely monotone. 



3. P-Boxes 



[x, y] = {z e ft : x < z < y} 
(x,y) = {z 6 ft : x < z < y} 



P-BOXES ON TOTALLY P REORDERED SPACES FOR MULTIVARIATE MODELLING 



5 



By 0/ ~ we denote the quotient set of f2 with respect to the equivalence relation ~ induced 
by <, that is: 

[x]~ = {y e fl : y ~ x} for any x e Q, 
n/~ = {[x]„: xefl} 
Because F is non-decreasing, F is constant on elements [x]~ of fi/ ~. 

Definition 1. A probability box, or p-box, is a pair (F_, F) of cumulative distribution functions 
from 57 to [0, 1] satisfying F ^ F. 

Note that some definitions, such as in 19 , differ in that they use the coarsest preorder for 
which F_ and F are non-decreasing, effectively imposing that x ~ y if and only if F_(x) = F(y) 
and F(x) = F(y). This paper follows [IT], only imposing that F_ and F are non-decreasing, so 
a; ~ y implies = and F(x) = F(y), but not the other way around, thereby admitting 

more preorders, possibly leading to tighter bounds (see Example [B] further on). 

A p-box is interpreted as a lower and an upper cumulative distribution function. In Walley's 
framework, this means that a p-box is interpreted as a lower probability P_ F -p on the set of 
events 

/C = {[0 n ,z]:zen}u{(j/,lfi]:yeO} 

E F<F ([0n,x]) = F(x) and P FtF ((y, In}) = 1 - F(y). 

In the particular case of p-boxes on [a, b] c R it was mentioned by [HJ Section 4.6.6] and proven 
by [39j p. 93, Thm. 3.59] that P_ F F is coherent. More generally, it is straightforward to show 
that p-boxes on an arbitrary totally preordered space (fl, <) are coherent as wellQ We denote by 
E_ F -p the natural extension of P_ F y to all gambles. We study this natural extension extensively 
further on. 

When F = F, we say that (F, F) is precise, and we denote the corresponding lower prevision 
on K, by P F and its natural extension to L by E_ F (with F := F_ = F). 
A few examples of p-boxes on [0, 1] are illustrated in Fig[T] 

Given a p-box, we can consider the set of distribution functions that lie between F_ and F, 

®(M,F) = {F: F^ F ^F} . 

We can easily express the natural extension E_ F F of P_ F F in terms of $(£, F): E_ F F is the lower 
envelope of the natural extensions of the cumulative distribution function^ F that lie between 
F and F: 

E FF (f)= mi_E F (f) (1) 

— ' Fe$(F.F) 

for all gambles / on ft. A similar result for p-boxes on [0, 1] can be found in [1TJ Section 4.6.6] 
and [331 Theorem 2]. To see why this holds, note that P_ F y is the lower envelope of the set 
■M{P FF ) of linear previsions dominating P_ F y, because E_ F y is a coherent lower probability. 
Each of the linear previsions Q in A4(P F F ) has a cumulative distribution function Fq, and it is 
easy to see that Fq e ${F, F). Conversely, any linear prevision whose cumulative distribution 
function F belongs to $(F, F) must belong to M(P FF ). Therefore, the set M(P_ FF ) coincides 

with the set of all linear previsions whose cumulative distribution function belongs to F), 
which establishes Eq. (fTJ). We shall use this equation repeatedly in subsequent proofs. This 

^The proof is virtually identical to the one given in 1391 p. 93, Thm. 3.59]. 

2 The natural extension of a cumulative distribution function F is simply understood to be the natural extension 
of the precise p-box (F, F). 



6 



M. TROFFAES AND S. DESTERCKE 





1^0 1 



Figure 1. Examples of p-boxes on [0, 1]. 

allows us moreover to give a sensitivity analysis interpretation to p-boxes: we can always regard 
them as a model for the imprecise knowledge of a cumulative distribution function. 
We end this section with a trivial, yet very useful, approximation theorem: 

Theorem 2. Let P be any coherent lower prevision defined on C The least conservative p-box 
(F_, F) on (CI, <) whose natural extension is dominated by P is given by 

F(x) = P([0n,x])) F(x) = P([0 n , x]) 

for all x e CI. 

Proof. Obviously, the natural extension E_ F F of the p-box, with F_ and F as above, is dominated 
by P, because P is an extension of P_ F F (see for instance [39, p. 98, Prop. 4.7]). 
Any other p-box (G, G) whose natural extension is dominated by P must satisfy: 

G(x) =Ec^([0 n ,x]) =E^([0 a ,x]) <P([0 n ,i]) = F(x) 

G(x) = 1 - Pg,g(& 1]) = Eg,g(\- ^ *]) > P([°n, x]) = F(x) 

so (F_, F) is indeed the least conservative one. □ 

4. Natural Extension to All Events 

The remainder of this paper is devoted to the natural extension E_ F -p of P_ F F , and to various 
convenient expressions for it. We start by giving the form of the natural extension on the field 
of events generated by K. 

4.1. Extension to the Field Generated by the Domain. Let T~L be the field of events 
generated by the domain /C of the p-box, i.e., events of the type 



[0n>2Cl] U (X2,X 3 ] U ■ • ■ U (x 2n ,X2n+l] 
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for x± < X2 < X3 < ■ ■ ■ < X2n+i in ^ (if n is we simply take this expression to be [On, x{\) and 

(ac 2 ,x 3 ] w ••• u (x 2 n,Z2n+i] 

for X2 < 2:3 < • ■ ■ < X2n + i in i7. Clearly, these events form a field: the union and intersection of 
any two events in H is again in H, and the complement of any event in T-L also is again in %. 

To simplify the description of this field, and the expression of natural extension, we introduce 
an element On — such thatQ 

On — < x for all x e fl 
F(On-) = £(0 n -) = F(On-) = 
In particular, (On— ,x] = [On, 2;]. If we let f2* = f2 u {On — }, then 

H = {(x ,xi] u (x 2l x 3 ] u • • • u (x2n,x 2n+ i] : x < Xi < ■ ■ ■ < x 2n +i in O*}. (2) 

Since the procedure of natural extension is transitive, in order to calculate the natural exten- 
sion of P F -p to all gambles, we shall first consider the extension from JC to T-L, then the natural 
extension from H to the set of all events, and finally the natural extension from the set of all 
events to the set of all gambles. 

In the case of a precise p-box, P_ F has a unique extension to a finitely additive probability 
measure on T-L. 

Proposition 3. E_ F restricted to % is a finitely additive probability measure. Moreover, for any 
A e %, that is A = (xo, Xi] u (£2, £3] u ■ • ■ u (x2 n , £ 2 n+i] with xq < x\ < ■ ■ ■ < X2 n +i in fi*, it 
holds that 

n 

E F (A)=Y,(F(x2 k+1 )-F(x2 k )) (3) 

Proof. Because 

E F ([0 n ,x])=P F ([0n,x]) = F(x) and 

E F ([0 n ,x]) = l-E F ((x,l n ]) = 1 - P F {{x, In]) = F(x) 

for all x, E_ F is linear on the linear space spanned by {[On, a;]: x e ft} P- 102, Prop. 4.18]. 
This linear space includes Ji, which proves that E_ F is additive on T-i, and consequently it is a 
finitely additive probability measure on H. The expressions for E_ F (A) follow immediately. □ 

We now extend Proposition [3] to p-boxes. 

Proposition 4. For any A e H, that is A = (xo,Xi] u (2:2,0:3] u ••• u (o: 2ra , X2n+i] with xq < 
X\ < ■ ■ ■ < X2n+i in it holds that E_ FF -(A) = P^-piA), where 

n 

P-E,f (A ^ = E Hiax{0,£(a;2 fe+ i) -F(x 2k )}. (4) 

k=Q 

Proof. We first show that E_ FF -{A) ^ P^-p(A). Consider a cumulative distribution function F 
in $(£, F) which satisfies 

F(x 2 k) =F(x 2k ) 
F(x 2 k+i) = max{F(x 2 k+i), F(x 2 k)} 

^The cunning reader notes that one could avoid introducing On — by imposing F(Qn) = F(0n) = 0. However, 
this leads to an apparent loss of generality when linking p-boxes to other uncertainty models, and a slightly more 
complicated equation for the natural extension — therefore, for our purpose, it is simpler to stick to the most 
general formulation. (F_(Iq) — F(Iq) — 1 follows from coherence, so nothing is lost there.) 
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for all k = Q,...,n. Note that ^ F(xq) ^ ■ ■ • P(#2n+i) 1) so there is a cumulative 
distribution function satisfying the above equalities. Secondly, note that for each k = 0, . . . , 2n + 
1, F satisfies F_(xk) f i F(xk) f i F{xk)- Hence, there is indeed a cumulative distribution function 
F in $(P, F) satisfying the above equalities. By Eq. (|3J), 

n 

E F {A) = J] mnx{F(x2k+i),F(x 2 k)} -F(x 2k ) 
k=a 

n 

= J] max{0,F(x2k+i) -F(x 2 k)} = P^p(A) 

fc=0 

with respect to F. Using Eq. (JTJ, we deduce that P]ip{A) ^ E FF (A). 

Next, we show that E_ F p{A) ^ P^ p{A). Let F be any cumulative distribution function in 
$(F,P). Then, 

E F ((x 2 k, x 2 k+i]) = F(x 2 k+i) - F(x 2 k) J-* max{0, F(x 2fe+ i) - P(x2fc)} 

since i^a^fe+i) P(^2fc+i) and —F(x2k) ^ — F(x2k)- But, because _Ep is a finitely additive 
probability measure on "H (Proposition |3J, P^p{A) sC ^ F (A). This holds for any F in $(P,F), 

and hence Eq. JT]) implies that P FF -(A) s£ E FF (A). □ 

Note that it is possible to derive a closed expression for the upper prevision E F p as well, 
similar to Eq. ((U), however that expression is not as easy to work with. In practice, to calculate 
E F p(A) for an event A e 7-1, it is by far easiest first to calculate E_ F p(A c ) using Eq. Qj (observe 
that also A c e H), and then to apply the conjugacy relation: 

Epp(A) = l-Epp(A c ). 

The lower probability P F p on IC determined by the p-box usually does not have a unique 
coherent extension to the field H (unless F_ = F), as shown in the following example. 

Example 5. Consider the distribution functions F_ and F given by F(x) = x for x e [0, 1] and 
F(x) = if x s: 0.5, F(x) = 2(x - 0.5) if x ^ 0.5. 

From Proposition [31 both P_ F and Pp have a unique extension to the field H. Let us define 
Pj on H by P_x(^4) := min{P F (/l), P_p(A)} for all A Then P_ x is a coherent lower prevision on 
W, and it is not difficult to show that Pj = P F F (^4) for any Ae tC. 

Now, given the interval (0.5, 0.6], we deduce from Proposition [3] that 

£i ((0.5, 0.6]) = min{P F ((0.5,0.6]),P F ((0.5,0.6])} 

= min{P(0.6) - P(0.5),P(0.6) -P(0.5)} = min{0.2,0.1} = 0.1. 

However, it follows from Proposition g] that E F F ((0.5, 0.6]) = max{0,P(0.6) - P(0.5)} = 
max{0,0.2-0.5} = 0. 

4.2. Inner Measure. The inner measure P F F of the coherent lower probability P F F defined 
in Eq. flU coincides with E FF on all events [HJ Cor. 3.1.9, p. 127]: 

E F p(A)=pK F (A)= sup P%(C), (5) 

— -' * CeW,CcA -' 

The next example demonstrates that the choice of the preorder < can have a significant 
impact, even for the same F and F. 
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Example 6. Take fl = {0, 1, 2, 3, 4} and consider: 

'0 i£x<2, 



F{x) = F(x) 



1 otherwise. 



Consider the total preorder <i defined by 1 <i 2 3 4 and the usual total ordering 
<2 defined by <2 1 <2 2 <2 3 <2 4. With <i, we have for any event A c Q that 



using Eqs. (0| and ([5]). However, with <2, 



,1 if {2,3,4} c A, 
otherwise. 



— I otherwise. 

For ease of notation, from now onwards, we denote E_ F p by E_ when no confusion about the 
cumulative distribution functions determining the p-box can arise. 

In principle, the problem of natural extension to all events is solved: simply calculate the 
inner measure as in Eq. ([5]), using Eq. (QJ to calculate P^-p(C) for elements C in %. However, 
the inner measure still involves calculating a supremum, which may be non-obvious. What we 
show next is that Eq. Q can be extended to arbitrary events, by first taking the topological 
interior with respect to a very simple topology, followed by a (possibly infinite) sum over the 
so-called full components of this interior. 

4.3. The Partition Topology. Consider the partition topology on fl generated by the equiva- 
lence classes \x\~, that is, the topology generated by 

T := {[x]~ : x 6 f2}. 

This topology is very similar to the discrete topology, except that it is not Hausdorff, unless < 
is anti-symmetric: if x ~ y but x y then x and y cannot be topologically separated, since 
every neighborhood of x is also a neighborhood of y and vice versa. If < is anti-symmetric (for 
example, the usual ordering on the reals is), then the partition topology reduces to the discrete 
topology, that is, every set is clopen (closed and open). 

The open sets in this topology are all unions of equivalence classes (or, subsets of 0/ ~, if 
you like). Hence, in this topology, every open set is also closed. In particular, every interval in 
(Q, <) is clopen. 

The topological interior of a set A is given by the union of all equivalence classes contained 
in A: 

mt(A) = \J{[x] = :[x] s ^A) (6) 

and the topological closure is given by the union of all equivalence classes which intersect with 
A: 

c\(A)={J{[xU: [x] s nA#0}. (7) 
Lemma 7. For any subset A of CI, E(A) = E(mt(A)) and E(A) = E(cl(A)). 

Proof. Clearly E(mt(A)) si E{A) because int(A) c A. If we can also show that E(m.t(A)) ^ 
E_(A) the desired result is established. 

Consider an element C of the field 7i which is included in A. Since C is in particular an 
open set in the partition topology, it is a subset of 'mt(A). The monotonicity of E_ implies that 
E(C) ^ E(mt(A)). Consequently, 

E(A) = P% T (A)= sup P^p{C)= sup E{C) ^ E(xnt(A)). 
— ' * Ce«,CcA — ' CeH,CcA 
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The equality E(A) = E(cl(A)) now follows from the fact that cl(A) = (int(A c )) c and E(A) = 
1-E(A C ) □ 

4.4. Additivity on Full Components. Next, we determine a constructive expression of the 
natural extension E_ on the clopen subsets of CI. 

Definition 8. [37J §4.4] A set S c Cl is called full if [a, 6]c5 for any a < b in S. 

What do these full sets look like? Obviously, any full set is clopen, as it must be a union of 
equivalence classes. 

Lemma 9. Every full set is clopen. 

Proof. Observe that [a, a] = [a]~, and apply Definition [5] □ 

Under an additional completeness assumption, the full sets are precisely the intervals. 

Lemma 10. LfCl/ ~ is order complete, that is, if every subset ofCl/ ~ has a supremum (minimal 
upper bound) and infimum (maximal lower bound), then every full set is an interval, that is, it 
can be written as \x,y\, \x,y), (x,y\, or (x,y), for some x, y in Cl. 

Proof. Consider a full set S in Cl. Since, by Lemma|9l S is clopen, we may consider S as a subset 
of Cl/ ~. So [x]~ = inf S and [y]~ = sup S exist, by the order completeness of Cl/ ~. Consider 
the case x e S and y $ S. Apply the definitions of inf and sup to establish that z e S if and only 
if x < z < y. The other three cases are proven similarly. □ 

Note that Q/ ~ can be made order complete via the Dedekind completion [371 §4.34]. 

Definition 11. j37j §4.4] Given a clopen set A c. CI and an element x of A, the full component 
C(x, A) of X in A is the largest full set S which satisfies x e S Q A. 

Lemma 12. The full components of any clopen set A form a partition of A. 

Proof. This is easily shown using a similar result for total orders given in [37J 4.4(a)]. □ 

In the following theorem, we prove that the natural extension E_ is additive on full components. 
Recall that the sum of a (possibly infinite) family (x\)\ e \ of non- negative real numbers is defined 
as 

^ x\ = sup ^] x x 

AeA £ L fi£tte Aei 

If the outcome of the above sum is a finite number, at most countably many of the xa's are 
non-zero [37, 10.40]. 

Theorem 13. Let B be a clopen subset of CI. Let (Bx)xeA be the full components of B, and let 
(Ca)aeA' be the full components of B c . Then 

mm = 2 mbx) 

AsA 



E(B) = 1-^ 



AsA' 



Proof. Since (-Ba)asA is a partition of B, and because the functional E_ is monotone and super- 
additive, 

E(B) = S(uasaS a ) ^ S(uaslSa) =5 V E{Bx) 



for every finite subset L of A, and consequently E_(B) ^ XasA Mi^x)- We are left to show that 
also E(B) ^y] XEA E(Bx). 
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Let e > 0. By Eq. ((5|), the inner measure coincides with the natural extension, so there is a C 
in H such that C c B and E(B) ^ E(C) + e. From Eq. ©, we can write 

C = (Xq, Xi] U (X2,X 3 ] U ■ ■ ■ U (x 2n ,X2n + l] 

for some xq < x\ < ■ ■ ■ < X2 n +i in Let us denote Ck ■= (x2k,X2k+i] for k = 0, . . . , n. It is 
easily established that Co, . . . , C n are the full components of C. 

Applying Proposition 2] twice, and using the fact that each full component Ck of C must be 
a subset of exactly one full component B\ of B (this is a consequence of C c £?) , we find that 

n 

E{C) = ^ £(C fc ) = J] £(u fc: c,c SA C fe ) 

fc=0 AeA 

(note that the union is for those A where no Ck £ B\ — so only a finite number of terms can 
be non-zero in the latter sum) and consequently 

E{B) ^ R(C) + e = Y> M^k: c k ^B x C k ) + e ^ £ ^ B x) + e, 
AeA AeA 

taking into account the monotonicity of E_. As this holds for any e > 0, we arrive at the desired 
inequality. 

The expression for E simply follows from the conjugacy relation E(B) = 1 — E_(B C ), once 
noted that B c is clopen as well. □ 

In other words, the natural extension E_ of a p-box is arbitrarily additive on full components. 
In particular, interestingly, it is er-additive on full components (but obviously not additive, let 
alone cr-additive, on arbitrary events). 

Example 14. Additivity on full components is not sufficient for a lower probability to be equiv- 
alent to a p-box, even in the finite case. For example, consider f2 = {1,2,3} with the usual 
ordering, so H = p(fl). Let P be the lower probability defined by 

£({!}) =£({2}) =£({3}) =0.1 

It can be checked that P is coherent, and that the natural extension E_ of P is the lower envelope 
of the probability mass functions (0.8,0.1,0.1), (0.1,0.8,0.1), and (0.1,0.1,0.8). Moreover, E is 
additive on full components, because 

£({1,3}) =£({1}) +£({3}) 

(every other subset of f2 is already full). However, 

S({2})^max{0,£({l,2})-^({1})} 

because £({1,2}) = 0.2 and E({1}) = 0.8. This shows that £ violates Proposition 0] and as a 
consequence it is not the natural extension to events of a p-box. 

4.5. Summary. Let us summarize all results so far, and explain how, in practice, E_(A) and 
E(A) of an arbitrary event A can be calculated. 

Proposition [4] gave the natural extension to the field H; we are now in a position to generalize 
it to all events, at least when ft/ ~ is order complete. 

Indeed, consider an arbitrary event A. By Lemma [71 it suffices to calculate the natural 
extension of hit (A) or cl(A). Calculating the interior or closure with respect to the partition 
topology will usually be trivial (see examples further on), and moreover, the topological interior 
or closure of a set is always clopen, so now we only need to know the natural extension of clopen 
sets. 

Now, by Theorem[l]2 it follows that we only need to calculate the natural extension of the full 
components (-Ba)a6A of int(A) or the full components (Ca)asA of c\(A) c = int(A c ) — note that 
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each of these full components is also clopen. Also, finding the full components will often be a 
trivial operation — commonly, there will only be a few. 

But, by Lemma 1101 if, in addition, Cl/ ~ is order complete, then each full component is an 
interval. And for intervals, we immediately infer from Proposition [4] and Eq. ([5]) that: 

E{{x,y]) = max{0,£(|/) -F(x)} (8a) 
E((x,y)) = max{0,F(y-)-F(x)} (8b) 

(max{0,F(y) — F(x)} if x has no immediate predecessor 
' I max{0, F(y) — F(x— )} if x has an immediate predecessor 

^ J max{0, F_(y— ) — F(x)} if x has no immediate predecessor 
' I max{0, F_(y— ) — F(x— )} if a; has an immediate predecessor 

for any x < j/ in 570 where F_(y—) denotes sup 2<y £(z) and similarly for F(x-). The equalities 
hold because, if x < y in Cl, and x— is an immediate predecessor of x, then [x, ?/] = (a;—, y] and 
[a;,?/) = (x—,y). Recall also that FJfin— ) = F(0q—) = by convention. If Cl/ ~ is finite, then 
one can think of 2— as the immediate predecessor of z in the quotient space 57/ ~ for any z e 57; 

In other words, we have a simple constructive means of calculating the natural extension of 
any event. 



4.6. Special Cases. The above equations hold for any (57, <) with order complete quotient 
space. In most cases in practice, either 

• CI/ ~ is finite, or 

• CI/ ~ is connected meaning that for any two elements x < y in CI there is a z in CI such 
that x < z < t/0 (this is the case for instance when 57 is a closed interval in K and < is 
the usual ordering of reals) 

Moreover, if CI/ ~ is connected, then F_ will commonly satisfy F_(y—) = E_(y) for all y in CI. For 
example, in case Cl is a closed interval in M, this happens precisely when £(0) = and -F is 
left-continuous in the usual sense. 

Obviously, if Cl/ ~ is finite, then every element of Cl has an immediate predecessor (remember, 
we take the immediate predecessor of Oq to be On—), and if Cl/ ~ is connected, then no element 
except On has an immediate predecessor. 

By Lemma [71 Theorem 1131 and Eqs. (JSJ), we conclude: 

Corollary 15. IfCl/ ~ is finite, then every full set B is of the form [a, b], and 

E(A) = ^ max{0,F(6 A ) -F(a x -)} 
AeA 

E(A) = 1 " E max{0,£(6' A ) -F(a' x -)} 
AeA' 

where ([a\,b\])\ e \ are the full components o/int(^4), and ([a' A , b' x ])\ e \i are the full components 
of mt(A c ) = cl(A) c . 



^In case x = 0q, evidently, On — is the immediate predecessor. 

^This terminology stems from the fact that, in this case, Q/ ~ is connected with respect to the order topology 
[33 §15.46(6)]. 
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Corollary 16. If£l/ ^ is order complete and connected, and F_(y—) = F(y) for all y in fl, then 

E(A) = J] max{0,£(supB A ) -F(w£ B x )} 
AeA 

E(A) = 1-2 max{0,£(supC A ) -F(inf C\)} 

AsA' 

where (-Ba)asA are the full components o/int(^4) and (Ca)asA' are the full components of'mt(A c ) = 
cl(A) c . 

Beware of F_(0q) = F_(0q—) = in the last corollary. 

4.7. Example. Let's investigate a particular type of p-boxes on the unit square [0, l] 2 . First, 
we must specify a preorder on Q. A natural yet naive way of doing so is, for instance, saying 
that (2:1,2/1) < (0:2,2/2) whenever 

xi + yi sS x 2 + y2 

Consider a p-box (F_,F) on ([0, l] 2 , <). Since F_ is required to be non-decreasing with respect 
to <, it follows that F_(x, y) is constant on elements of [0, l] 2 / ~, which means that F{x\,y\) = 
F{x2,y2) whenever x\ + y% = x-i + y%. Thus, we may think of F_(x, y) as a function of a single 
variable z = x + y, and we write F_(z). Similarly, we write F(z). 

Our definition of < means that our p-box specifies bounds on the probability of right-angled 
triangles (restricted to [0, l] 2 ) whose hypothenuses are orthogonal to the diagonal: 

F(z) *S p({(x, y) e [0, l] 2 : x + y ^ z}) ^ F(z) (9) 

Observe that the p-box is given directly on the two-dimensional product space, without the need 
to define marginal p-boxes for each dimension. The base r for our partition topology is given by 

T = {{(x,y)e[0,l] 2 :x + y = z}: ze[0,2]} 

For example, the topological interior of a rectangle A = [a, b] x [c, d] is empty, unless a = c = 
or b = d = 1, because in all other cases, no element of r is a subset of A. In the cases where 
a = c = and min{6, d} < 1, or maxjo, c} > and b = d = 1 (if a = c = and b = d = 1 then 
the interior is fi), respectively, we have: 

int([0,6] x [0,d]) = {(x,y) e [0, l] 2 : x + y s= min{6,d}} 

int([a, 1] x [c, 1]) = {(x, y) e [0, l] 2 : x + y ^ 1 + maxjo, c}} 

Consequently, E_(A) = for all rectangles A, except for 

E([0,6] x [0,d]) = F(mm{b,d}) 

E([a, 1] x [c, 1]) = 1 - F(l + maxjo, c}) 

Fig- HI illustrates the situation. So, for the purpose of making inferences about the lower proba- 
bility of events that are rectangles, the ordering < was obviously poorly chosen. In general, one 
should choose < in a way that £1/ ~ contains good approximations for all events of interest. 

For example, in the case of rectangles, we could for instance discretize [0, l] 2 into smaller 
squares, and impose some ordering on these squares. Of course, it may not be entirely obvious 
how to interpret the lower and upper cumulative distribution functions on such discretized space, 
since there is no natural ordering on such discretization. Another strategy would be to start 
from a reference point (e.g., an elicited modal value) and then to choose the ordering < such 
that intervals correspond to concentric regions of interests around the reference point. Again, all 
of this is possible because our theory concerns p-boxes on arbitrary totally prcordercd spaces, 
and is not limited to the real line with its natural ordering. More realistic examples in which 
such concentric regions are used are given in Section [5J 
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a 




□ x + y sC 0.5 6 

j | 0.5 ^ x + y ^ 1.2 [J a; + ?/ ^ min{6, <i} Q x + y ^ 1 + max{a, c} 

□ * + !/ 1-2 

Figure 2. Shape of intervals induced by <, and calculation of the topological interior. 

5. Natural Extension to All Gambles 

Next, we establish that p-boxes are completely monotone, and that therefore their natural 
extension to all gambles can be expressed as a Choquet integral. We further simplify the calcu- 
lation of this Choquet integral via the lower and upper oscillation of gambles with respect to the 
partition topology introduced earlier. 



5.1. Complete Monotonicity. As shown in [33l Section 3.1], the natural extension E_ F of a 
distribution function F on [0, 1] is completely monotone. It is fairly easy to generalise this result 
to distribution functions on an arbitrary totally preordered space However, given this, and 
Eq. (TTJ), we cannot immediately deduce the complete monotonicity of E_ff> because the lower 
envelope of a set of completely monotone lower previsions is not necessarily completely monotone. 
We prove next that such an envelope is indeed completely monotone in the case of p-boxes. This 
is an improvement with respect to previous results [19 , where the relation between p-boxes and 
complete monotonicity was established for finite spaces. 

Let P^-p denote the restriction of E_ F -p to Ti, given by Proposition HJ 

Theorem 17. P^-p is a completely monotone coherent lower probability. 

Proof. Clearly, y is coherent as it is the natural extension to % of a coherent lower probability 
P_ F -p [41, p. 123, 3.1.2(a)]. To prove that it is completely monotone, we must establish that for 
alFp EN,2^p^n, and all A u . . . , A p in H: 

rh(uA> s (-i) |/i+i ^fn^V ( io ) 

V = l / 0^/C{l,...,p} J 

For any p e N, 2 ^ p ^ n, and any A\, . . . , A p in H, consider the finite field generated by Ai, 
. . . , A p . Let Q denote the restriction of E^p to this finite field. By [TU Sec. 3], Q is completely 
monotone on this finite field. In particular, Eq. (fTU|) is satisfied. But this means that Eq. (TlUl) is 
satisfied for all p e N, 2 < p ^ n, and all A\, . . . , A p in Tl, which establishes the theorem. □ 



^Indeed, by [9] Thm. 5 &; Thm. 9], the natural extension of any finitely additive probability on a field is 
completely monotone. 
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5.2. Choquet Integral Representation. Complete monotonicity allows us to characterise the 
natural extension on all gambles, as we show in the following theorem: 

Theorem 18. The natural extension E_ of P_ F j; is given by the Choquet integral 

J -sup/ 
E{{f^t})dt 
in!/ 

for every gamble f . Moreover, E_ is completely monotone on all gambles. Similarly, 

J -sup / 
E({f>t})dt. 
inf/ 

Proof. Immediate from Theorem [T7J once observed that E_ = P^-h on all events, and jjSJJ Theo- 
rems 8 and 9]. The latter two theorems state that: 

• Given a coherent n-monotone (n ^ 2) lower probability P defined on a field (here, T-i), 
its natural extension to all gambles is given by its Choquet integral (C) J • dP_% 

• If a coherent lower probability P_ defined on a field is n- monotone (n ^ 2), then its 
natural extension to all gambles is n-monotone. 

□ 

5.3. Lower and Upper Oscillation. By Lemma [71 to turn Theorem [T51 in an effective algo- 
rithm, we must calculate int({/ ^ t}) for every t. Fortunately, there is a very simple way to do 
this. 

For any gamble / on VL and any topological base r, define its lower oscillation as the gamble 

Qsc(f)(x) = sup inf f(y) 

Cet: xeCV eC 

For the partition topology which we introduced earlier, this simplifies to 



osc(f)(x)= inf f(y) (11) 



The upper oscillation is: 



osc(f)(x) = -osc(-f)(x) = sup f(y) (12) 

For a subset A of ft, we deduce from the above definition and from Eq. © that the lower 
oscillation of I a is A n t(A)j so the lower oscillation is the natural generalisation of the topological 
interior to gambles. Similarly, we see from Eq. ([Jj that the upper oscillation of I a is / c i(A)- 

Proposition 19. For any gamble f on ft, 

int({/ ^ t}) = {osc(f) =s t} (13a) 
cl({/ ^ t}) = {osc(f) ^ t] (13b) 

so, in particular, 

^ sup osc ( f) 

E(f) = Mosctf) + E{{Qsc{f) t}) dt = E{gsc{f)) (14a) 

Jinf osc (f) 
/•sup osc(f) 

E(f) = inf osc(f) + __ E({osc(f) ^ t}) dt = E(osc(f)) (14b) 

Jinf osc(f) 
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Proof. Eqs. (|13[) are easily established using the definitions of interior and closure, and lower and 
upper oscillation. For example, 

x e int({/ > t}) 
if and only if there is a C in t such that x e C and 

But, for our choice of r, necessarily C = [x]~ if x e C e r, so the above holds if and only if 
And this holds if and only if 

osc(f)(x) i 

where we used the defintion of osc ( f). 

Let us prove Eqs. (fH|) . It follows from Eq. (fTTj) that / ^ osc (f), and as a consequence 
£(/) ^ E(osc(f)). We are left to prove that E(f) sC E(osc{f)). 

Indeed, using Lemma [7J and Eqs. (fTU)) . 

J -sup / ,-sup / 

£({/ f }) dt = inf / + E({Qsc(f) > t}) dt 

inf / Jinf / 

and since obviously, by Eq. (jlip . inf / = inf osc ( f ), 

J -sup/ 
E({osc(f) ^ t})dt 
inf osc ( f) 

and since snp/ ^ supo5c(/), using the usual properties of the Riemann integral, 

psup osc ( f) 

= ini osc( f) + M({osc(f) ^ t}) dt 

Jinf osc ( f) 

J -sup/ 
sup osc( f) 

Now use the fact that {osc(/) ^ t} = for t e (sup osc(f), sup /], so the last term is zero. □ 

Concluding, to calculate the natural extension of any gamble, in practice, we must simply 
determine the full components of the cut sets of its lower or upper oscillation, and calculate a 
simple Riemann integral of a monotonic function. 

Examples will be given in Section [S] 

6. P-Boxes Whose Preorders are Induced by a Real- Valued Function 

In practice, the most convenient way to specify a preorder < on 57 such that 51/ ~ is order 
complete and connected is by means of a bounded real- valued function Z : Q — > R. For instance, 
in the example in Section l4~7l we used Z(x, y) = x + y. Also see [2] and [28] . 

Let us assume from now onwards that Z is a surjective mapping from 57 to [0, 1]. 

For any x and y in 51, define x < y whenever Z(x) ^ Z(y). Because Z is surjective, Q/ ~ 
is order complete and connected. In particular, 57 has a smallest and largest element, for which 
Z(0n) = and Z{1q) = 1. Moreover, we can think of any cumulative distribution function on 
(57, <) as a function over a single variable z 6 [0, 1]. Consequently, we can think of any p-box 
on (57, <) as a p-box on ([0, 1], ^). In particular, for any subset / of [0, 1] we write E_(I) for 
MZ^il))- For example, for a, b in [0, 1], and A = Z _1 ((a,6]) c 57, we have that 

E(A) =E((a,b]) = max{0, F(a) -F(b)} 
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by Proposition 21 Similar expressions for other types of intervals follow from Eqs. ([5]). 

The topological interior and closure can be related to the so-called lower and upper inverse 
of Z~ x . Indeed, consider the multi-valued mapping T := Z~ x : [0, 1] — * p(Sl). Because for every 
x in VL, it holds that [x]~ = T(Z(x)), it follows that, for any subset A of f2, int(A) = r(r*(A)), 
and c\(A) = T(r*(^4)), where and T* denote the lower and upper inverse of T respectively, 
that is 

T t (A) = {z £ [0, 1] : T(z) c A}, and 
T*(A) = {ze[O,l]:T(z)nA^0} 

(see for instance [16]lFI 

Theorem 20. Let A be an arbitrary subset of£l. Then 

AeA 

E{A) = 1-2 E(J X ) 

AeA' 

where (I\)\ e A ore the full components of Z(mt(A)) = r*(A) and (Ja)aeA' are the full components 
of Z(mt(A c )) = Z(cl(A) c ) =T4A C ). 

If, in addition, F_ is left- continuous as a function of z e [0,1] and F_(0) = 0, then 

E(A) = Yj max{0, ; F(sup/A) -F(m£I x )} 
AeA 

E(A) = 1-^] max{0,£(sup J A ) -F(inf J A )} 
AeA' 

Proof. Indeed, by Corollary [TBI 

E(A) = E(mt(A)) = Y 1 E(B x ) 
AeA 

where (-Ba) AeA are the full components of int(A). So, the result is established if we can show 
that (Z~ 1 (I\))\ e \ are the full components of int(j4). 

Obviously, since (I x ) Xe A partitions Z(int(A)), it follows that (Z (I x )) Xe \ partitions 

|J Z-\h) = Z-\Z(wk(A))) = mt(A) 

AeA 

where the latter equality follows from the fact that int(A) is clopen, i.e., is a union of equivalence 
classes. 

We are left to prove that each set Z~ l {I\) is a full component. Clearly, Z~ 1 (I\) is full: for 
any two x and y in Z~ 1 (I\), it holds that 

[x,y\ = {ven: Z(x) Z(v) sC Z(y)} = Z-\[Z(x), Z(y)]) c Z~ x (l x ) 

where we used the fact that [Z(x), Z(yJ] c: I x in the last step. 

Consider any x e Z _1 (2a)- The desired result is established if we can show that Z _1 (/a) is 
the largest full set S which satisfies ieSc int(A). 

Suppose S is larger, that is, S is full, ieSc int(A), and Z~ x {l x ) C S. Since both sets are 
clopen, it must be that there is some y e S such that [y]~ n Z (Ix) = 0- But this implies that 

I x = Z(Z-\I X )) C Z(S) 



7 We follow the terminology in 1341 and 1221 . among others. Beware that r$ and T* are sometimes called upper 
and lower inverse instead [10 3 |. or strong and weak inverse |14| . 
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because Z(y) belongs to Z(S) but not to Z(Z^ 1 (I\)). But, this would mean that I\ is not a 
full component of Z{vat(A)) — a contradiction. So, Z~ X (I\) must be the largest full set S which 
satisfies x e S c int(A). □ 

Regarding gambles, note that the lower oscillation is constant on equivalence classes (this 
follows immediately from its definition). Hence, we may consider osc ( f) for a gamble / on $1 as 
a function of z e [0, 1], and we can use Proposition 1191 to write: 

Proposition 21. For any gamble f on SI, 

J .sup osc j f) 
E({z:osc(f)(z)^t})dt 
inf osc (f) 

J>suposc(/) 
E({z:osc(f)(z)^t})dt 
inf osc( f) 

7. Constructing Multivariate P-Boxes from Marginals 

In this section, we construct a multivariate p-box from marginal coherent lower previsions 
under arbitrary rules of combination. As special cases, we derive expressions for the joint, 

(i) either without any assumptions about dependence or independence between variables, that 
is, using the Frechet-Hocffding bounds [25] . 

(ii) or assuming epistemic independence between all variables, that is, using the factorization 
property [7]. 

We also derive Williamson and Downs's [15] probabilistic arithmetic as a special case of our 
framework. 

Specifically, consider n variables X\, . . . , X n assuming values in X\, . . . , X n , and assume that 
marginal lower previsions P ± , . . . , P_ n , are given for each variable — each of these could be the 
natural extension of a p-box, although we do not require this. So, each is a coherent lower 
prevision on C(Xi). 

7.1. Multivariate P-Boxes. The first step in constructing our multivariate p-box is to define 
a mapping Z to induce a preorder < on O = X i x ■ • • x X n . The following choice works perfectly 
for our purpose: 

Z(xi,. ..,!„) = maxZi(xi) 

i — l 

where each Zi is a surjective mapping from Xi to [0, 1] and hence, also induces an marginal 
preorder <i on Xi. Each P_ t can be approximated by a p-box (F_i,Fi) on (Xi, <,), defined by 

F l (z)=P l (Z-\[0,zl)) F l (z)=P l (Z-\[0,z])) 

This approximation is the best possible one, by Theorem [5J 

Beware that even though different choices of Zi may induce the same total preorder they 
might lead to a different total preorder < induced by Z. Whence, our joint total preorder < is 
not uniquely determined by <j. Roughly speaking, the Zi specify how the marginal preorders 
<i scale relative to one another. As we shall see, this effectively means that our choice of Zi 
affects the precision of our inferences: a good choice will ensure that any event of interest can 
be well approximated by elements of SI/ ~. Of course, nothing prevents us, at least in theory, 
to consider the set of all Zi which induce some given marginal total preorders <j, and whence 
to work with a set of p-boxes. In some cases, this may result in quite complicated calculations. 
However, in Section [7.41 we will see an example where this approach is feasible. 

Anyway, with this choice of Z ' , we can easily find the p-box which represents the joint as 
accurately as possible, under any rule of combination of coherent lower previsions: 
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Lemma 22. Consider any rule of combination Q of coherent lower and upper previsions, map- 
ping the marginals P_ 1; P n to a joint coherent lower prevision OiLi— ? on °^ gambles. 
Suppose there are functions I and u for which: 

0£, (fW =e(P 1 (A l ),...,P n (A n )) and 

QT l \fl A A =u(P 1 (A 1 ),...,P n (A n )), 

for all A\ c X\, . . . , A n c X n . Then, the couple (F_, F) defined by 

F(z) = (.{F^z), . . . ,F n (z)) F(z) = uiF^z), . . . ,F n (z)) 

is the least conservative p-box on (CI, <) whose natural extension E_ F -p is dominated by the 
combination ©™ =1 P4 of P_ x , . . . , P n . 

Proof. By Theorem [3J the least conservative p-box on (CI, <) whose natural extension is domi- 
nated by the joint P_ = ©?=i P.i is given by 

F(z)=P(Z-\[0,z])) F(z)=P(Z-\[0,z])) 

Now, observe that the set Z _1 ([0, z\) is a product of marginal intervals: 

Z _1 ([0, z\) = {(xi, . . . , x n ) e CI: max Zi(xi) ^ z} 

i = l 

= {(»!, . ..,x n ) e CI: (Vi = 1,. . .,n)(Z i (x l ) sS z)} 

n n 

= Y\{xi e X % : Zi(xi) < z) = ]] Z^flp, «]). 
i=l i=l 
The desired equalities follow immediately. □ 

7.2. Natural Extension: The Frechet Case. The natural extension KI™ =1 P i of P x , . . . , P n 
is the lower envelope of all joint distributions (or, linear previsions) whose marginal distributions 
(or, marginal linear previsions) are compatible with the given marginal lower previsions. So, 
the model is completely vacuous about the dependence structure, as it includes all possible 
forms of dependence. We refer to for instance p. 120, §3.1] for a rigorous definition. In this 
paper, we only need the following equalities, which are known as the Frechet bounds [27] (see 
for instance [ISJ p. 131] for a more recent discussion): 

jk\Pi (f\ A-] = max jo, 1 - n + J £i(^i) j and (15a) 
5m ( ff Ai ) = m%aPi(Ai) (15b) 

*=i Vii J 1=1 

for all Ai Q Xi, . . . , A n c X n . 

Theorem 23. T/ie p-box (F,F) defined by 

F(z) =max|o,l-?i + 2^( z )J ^( 

z) = min Fi(z) 

is the least conservative p-box on (CI, <) whose natural extension E_ F -p is dominated by the natural 
extension K" =1 1^ of P 1> . . . , P_ n . 

Proof. Immediate, by Lemma |2"21 and Eqs. (fT5")) . □ 
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The next example shows that, even when P 4 are p-boxes, the joint p-box will in general only 
be an outer approximation (although the closest one that is a p-box) of the joint lower prevision. 

Example 24. Consider two variables X and Y with domain X = {x\,x 2 }, with x\ < x 2 , and 
y = {2/1,2/2}, with yi < y 2 . Consider 

F 1 (x 1 )=0A, F^Zi) =0.6, F 1 (x 2 )=F 1 (x 2 ) = l, 

F 2 ( yi ) = 0.2, F 2 ( yi ) = 0.3, F 2 (y 2 ) = F 2 (y 2 ) = 1. 

Let P_ x be the natural extension of (P^Pi), and let P 2 be the natural extension of (P 2 ,P 2 ). 
Consider the events A = {x\} x y and B = X x {y 2 }. Writing P for B^ =1 P j , we have that 

E(A) = E 1 ({x 1 }) = max{0,F 1 (a;i) —Fi(xi—)} = 0.4 
P(B) =P 2 ({y 2 }) = max{0,F 2 (y 2 ) -F 2 ( yi )} = 0.7 

whence, 

P(A uB)= max{P(A),£(S)} = 0.7, 

P(A nB) = max{0, 1 - 2 + P(A) + P(B)} = 0.1. 

But this means that P is not even 2-monotone, because P(A u B) + P_(A n B) < P_{A) + 
P(B). Therefore, P cannot be represented by a p-box, as p-boxes are completely monotone by 
Theorem [T71 

7.3. Independent Natural Extension. In contrast, the independent natural extension ®" =1 Pj 
of P-l, . . . , P„ models epistemic independence between X\, . . . , X n . We refer to [7] for a rigorous 
definition and properties. In this paper, we only need the following equalities: 

n / n \ n 

(g) p, n a < = n s ) and ( i6a ) 

i = l \i = l / i=l 
n / n \ n 

(g)Pi =n^(^) ( ieb ) 

i = l \i = l / i = l 

for all Ax Q Xi, . . . , An c X n . 

Theorem 25. The p-box (P, P) defined by 

n n 

F(z) = Y\F l (z) P(z) = [piM 

i=l i=l 

is t/ie least conservative p-box on (f2, <) whose natural extension E_ F ~p is dominated by the 
indepedent natural extension ®"_ 1 P i of P 1} . . . , P n . 



Proof. Immediate, by Lemma [221 and Eqs. ([16]) . □ 

Again, in general, the joint p-box will only be an outer approximation of the actual joint lower 
prevision. 

Example 26. Again, consider two variables X and Y with domain X = {xi,x 2 }, with x\ < x 2 , 
and y = {2/1,2/2}, with y\ <y 2 . Consider 

P 1 (xi)=0.4, P 1 (a; 1 ) = 0.6, F x {x 2 ) = Pi(z 2 ) = 1, 

F 2 {y x ) = 0.3, Fa(tfi) = 0.5, P 2 (?y 2 ) = P 2 (jy 2 ) = 1. 
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As before, let P_ x be the natural extension of (F_i,Fi), and let £ 2 be the natural extension of 
(£2,^2)- Consider the events A = {(x\,y\), (xi, 2/2)} and B = {(2:1,2/2), (£2,2/1)}- Writing £ for 
®™ = i£i> we have that 

P(A)=P 1 ({x 1 })=0A 
P{B) ^ 0.4 

where the last inequality follows from the fact that all probability mass functions p which dom- 
inate £ must satisfy p(xi\y 2 ) $' P({xi}) = 0.4 and p(x2\yi) i' £({£2}) = 0.4, whence 

p(B) = p(x 1 \y 2 )p(y 2 ) + p(x 2 \yi)p(yi) Ss 0A(p( yi ) +p(y 2 )) = 0.4 

for all p which dominate P. Because £ is the lower envelope of all such p, the desired inequality 
follows^ Also, because of the factorization property of the independent natural extension, 

P(A uB) = 1 - P({(x 2 ,y 2 )}) = 1 - Pi({.T 2 })P 2 ({?i2}) = 1 - 0.6 x 0.7 = 0.58 
P(A nB)=P({(x 1 ,y 2 )}) = P 1 ({x 1 })P 2 ({y 2 }) = OA x 0.5 = 0.2. 

Again, this means that £ cannot be represented by a p-box, as it violates 2-monotonicity. 

7.4. Special Case: Probabilistic Arithmetic. Let Y = Xi + X2 with X\ and X2 real- 
valued random variables. One can also consider substraction, multiplication, and division, but 
for simplicity, we stick to addition — the other three cases follow along almost identical lines. 

Probabilistic arithmetic [46] deals with the problem of estimating Py([— 00, yj) = F_ Y (y) and 
Py ([— 00, y]) = Fy(y) for any i/el under the assumptions that 

• the uncertainty on X\ and X2 is given by p-boxes {F_\,F\) and (£2,-^2), with <i and 
<2 the natural ordering of real numbers|j and 

• the dependence structure is completely unknown (Frechet case). 

Using the Frechet bounds, Williamson and Downs [45 provide explicit formulae for the different 
arithmetic operations, thus providing very efficient algorithms to make inferences from marginal 
p-boxes. 

Let us show, for the particular case of addition, that their results are captured by our joint 
p-box proposed in Theorem [23l Other cases, not treated here to save space, follow from almost 
identical reasoning The lower cumulative distribution function resulting from probabilistic 
arithmetic is, for any y e R 

F Xi+X2 (y)= sup max{0,£ 1 ( a; i)+F 2 ( a;2 )-l}. (17) 

X1,X2 : Xl+X2=3/ 

Without much loss of generality, and for our convenience, assume that both X\ and A" 2 lie in a 
bounded interval [a, b]. 

Let Z\ and Z2 be any surjective maps [a, b] —* [0, 1] which induce the usual ordering on [0, 1]. 
Some properties of Z\ and Z2 immediately follow: both are continuous and strictly increasing, 
and so are their inverses — we rely on this in a bit. 

To apply Theorem 1231 we consider the total preorder < on Q = [a, b] 2 induced by Z(x\, x 2 ) = 
max{Zi(xi), ^2(2:2)}. Figure [3] illustrates the eventTi {X\ + X2 sS y], with y e [2a, 2b], as well 
as the largest interval Z _1 ([0, z\) included in it. Recall that 

Z-H[0,z]) = Z^{[0,z\) x Z 2 \[0,z]) = [0,Z^(z)] x [0,Z 2 \z)]. 



'By linear programming, it can actually be shown that P(B) — 0.4. 
'For substraction and division, <2 is the reverse natural ordering. 

^Note that Xi and X2 are assumed to be positive in case of multiplication and division. 
1 By {Xi + X2 =S y) we mean {(2:1,0:2) £ [0, l] 2 : x\ + X2 ^ y}. 
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Z 2 \z) 



L 

□ {X! + X 2 ^y} 



FIGURE 3. The event {Xi + X 2 ^ y}, and the largest interval Z _1 ([0, z\) in- 
cluded in it. 



Whence, for z such that Z^{z) + Z 2 ' 1 (z) = y, we achieve the largest interval Z _1 ([0, z]) which 
is still included in {Xi + X 2 ^ y}- There is always a unique such z because also Z^ 1 + Z^ 1 is 
continuous and strictly increasing. 

Recall that, by Theorem 1231 (now without shortcuts in notation), 

F{Z-\z)) = maxjO,^^- 1 ^)) + F 2 {Z^(z)) - 1} 



F(Z^{z)) 



min{F x {Z^{z)),F 2 {Z 2 \z))} 

2 — 1 



is the least conservative p-box on (f2, <) whose natural extension is dominated by the natural 
extension P 2 Kl P_ 2 of P 1 and P_ 2 . 

Also, as we have just shown, Z _1 ([0, z]), for our choice of z, is the topological interior of 
{Xi + X 2 ^ y). Whence, by TheoremCS! we find that 

E^dXt +X 2 ^ y}) = E E f(Z-\[0, z])) = F(Z~\z)) 

= maxlO^Zr 1 ^)) +F 2 (Z 2 - 1 (z)) - 1} 

where we remember that z is chosen such that Z± (z) + Z 2 ~ 1 (z) = y. 
But, this holds for every valid choice of functions Z\ and Z 2 , whence 

P 1 MP 2 ({X 1 +X 2 ^y})^ sup mflx{0,£ 1 (a;i) + F 2 (x 2 ) - 1} 

which indeed coincides with Eq. (|17p. Similar arguments hold for the upper cumulative distri- 
bution functions, and other arithmetic operations. 

In conclusion, probabilistic arithmetic constitutes a very specific case of our approach. 

8. Examples 

In this section, we investigate two different examples in which p-boxes are used to model 
uncertainty around some parameters. The first example concerns a damped harmonic oscillator, 
i.e., a classical engineering toy example. The second example concerns the evaluation of a river 
dike height, an important issue in regions subject to potential floods. 

8.1. Damped Harmonic Oscillator. Consider a simple damped harmonic oscillator, with 
damping coefficient c > 0, spring constant k > 0, and mass m > 0. The damping ratio 
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Rectangular equivalence class. 



Elliptical equivalence class. 



Figure 4. Different possible equivalence classes. 



determines how quickly the oscillator returns to its equilibrium state — £(c, k) = 1 means fastest 
convergence. Suppose the engineering design has already been completed, so the optimal values 
for c* and k* have been determined, such that C( c * ; k*) = 1. 

Without loss of generality, we choose the units for mass, time, and length, such that m = 
k* = C(c*,fc*) = 1 (so c* = 2). 

Of course, the actual values for c and k will differ from their design values, and uncertainty 
must be taken into account. Let us calculate the lower and upper expectation of £(c, fc), given 
that our uncertainty about c and k is described by a p-box. 

First, we must specify a preorder. For this problem, it seems fairly natural to have bounds on 
the quantiles of the distance between the actual values (c, k) and the design values (2, 1). This 
comes down to for instance the following choice for Z: 



For simplicity, we only consider the region Z(c, k) ^ 1. This means that we are certain that 
c e [1,3] and k e [0.5,1.5] — if necessary, Z can be rescaled to accomodate larger or smaller 
regions. Note that we have taken a supremum norm as distance. This simplifies the calculations 
below, but of course, one might as well take the Euclidian norm, or any other reasonable distance 
function, at the expense of slightly more complicated calculations and dependency modelling (see 
Fig.©. 

Equivalence classes [(c, fc)]~ are edges of rectangles with vertices 



What is a p-box for the preorder < induced by Zl A p-box (F_, F) specifies lower and upper 
bounds for the probability of concentric rectangles around the design point (2, 1): 



Z(c,k) =max{|c-2|,2|fc-l|} 



(2 + Z(c,k),l +Z(c,k)/2). 



F(z)^p({(c,k): Z(c,k)^z})^F(z) 



So, effectively, our p-box specifies concentric prediction regions for the uncertain parameters c 
and k. 
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FIGURE 5. The lower oscillation osc(Q(z) and upper oscillation osc(Q(z) 

z{t) 




Figure 6. The function z(t) which determines the cut sets. 

We can now calculate the lower and upper expectation of £(c, k). First, we calculate the lower 
oscillation osc(Q and upper oscillation osc(Q ( see Fig. [5]): 

2 — z 

osc (C)(z) = inf C(c,k) = , = 

J J (c,k): Z(c,k)=z^ K ' 2y/(l+ Z /2) 

2 + z 

osc(Q(z) = sup C(c,k) = — - 

(c,k): Z(c,k) = z 2 y (1 — 2/2) 

Next, we find the full components of the events 

U = {z e [0, 1] : osc(C)(z) > t} = \ z e [0, 1] : 



2-z 



Ui 



6 [0,1]: 555(0(2!) 



6 [0,1]: 



2V(! + V2) 
2 + z 



3s t 



^ t 



for all i e [0, 0.5]. Fortunately, osc (C) is decreasing as function of z, and osc(£) is increasing, and 
hence L t = [0,£t] and U t = [u t , 1], with (see Fig. EJ) 

2 - i(-t + V* 2 + 8) := z(t) withieM=,l] 

with i e [1, -^] 



-2 + t(-t + ^t 2 + 8) := 



Note that the given bounds for t arise from the minimum and maximum of osc ( f) and osc(f). 
Concluding, by Proposition [2TJ when F(z) = F(z—) for all z e [0, 1] and F(0) = 0, 



- (0 = 76 + ll Z(Z(t))di 
£(0 = 1 + ^ (!-£(-*(*))) 



dt 
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symbol name unit 

h overflow height of the river m 

q river flow rate m 3 s _1 

b river width m 

k Strickler coefficient to 1 / 3 s _1 

u upriver water level m 

d downriver water level m 

£ length of river stretch m 



Table 1 . Meaning of the variables used in Eq. (fTS)) 



Interestingly, both the lower and upper expectation of £ are determined by the lower cumu- 
lative distribution function only. Hence, in this problem, we actually do not need to elicit the 
upper cumulative distribution functionl 12 ! 

For example, if the expert says that c and k are independent, and the marginal lower cumu- 
lative distribution functions are uniform once [1 , 2] and k e [0.5, 1.5], so F_i( z ) = Es( z ) = z i 
with preorders induced by Zi(c) = |c — 2| and Z%(k) = 2\k — 1|, then, by Theorem l25l because 
Z = max{Zi, Z2}, it follows that F_{z) = z 2 , and 

mo 

E(0 



V6 



z(tfdt = 0.584 



7e 

f 3/V2 

1 + J (1 - z{tf) dt = 1.664 



8.2. River Dike Height Estimation. We aim to estimate the minimal required dike height 
along a given stretch of river, using a simplified model that is used by the EDF (the French 
integrated energy operator) to make initial evaluations [13] . Although this model is quite simple, 
it provides a realistic industrial application. Skipping technical details, the model results in the 
following relationship: 

h(q,k,u,d)= J 0^7%^) i£q>0 (18) 
I otherwise. 

The meaning of the variables is summarised in Table [TJ 

For the particular case under study, the river width is b = 300m and the river length is 
£ = 6400m. The remaining parameters are uncertain. Expert assessment leads to the following 
distributions: 

• The maximal flow rate q has a Gumbel distribution^" 3 ] with location parameter \x = 
1335m 3 s _1 and scale parameter f3 = 716m 3 s _1 . For calculations, it is easier to work 
with symmetric distributions. Therefore, we introduce a variable r satisfying 

q = fi — P ln(— ln(r)). 



Of course this will not always be the case — it just happens to be so for this example. 
13 The Gumbel distribution models the maximum of an exponentially distributed sample, and is used in extreme 
value theory 1 1 21 to model rare events such as floods. 
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p(k) 




45 k 



15 30 



□ p(fce [30-15z,30 + 15z]) 



Figure 7. Derivation of the p-box for a triangular distribution. 

If r is uniformly distributed over [0, 1], then q has the Gumbel distribution with location 
parameter fi and scale parameter /3. So, after transformation, 



• The Strickler coefficient k has a symmetric triangular distribution over the interval 
[15m 1 / 3 s~ 1 , 45m 1 / 3 s~ 1 ] (with mode at k* = 30m 1 / 3 s~ 1 ). 

• There is also uncertainty about the water levels u and d, because sedimentary conditions 
are hard to characterise. Measured values are u* = 55m and d* = 50m, with measure- 
ment error definitely less than lm. These are also modelled by symmetric triangular 
distributions, on [54to, 56m] and [49m, 51m] respectively. 

Again, a natural choice for Z is the distance between the expected values (r* = 1/2, k* = 
30, u* = 55, d* = 50) and the actual values (r,k,u,d): 



The scale of the distances has been chosen such that Z(r,k,u,d) ^ 1 for all points of interest. 
Equivalence classes [(r, k, u, d)]~ are borders of 4-dimentional boxes with vertices 



The lower oscillation osc (h) and upper oscillation osc(h) can be calculated along the same 
lines as in the previous example: 




Z(r, k, u, d) = max{2|r - 1/2|, \k - 30|/15, \u - 55|, \d - 50|}, 



((1 ± z)/2, 30 ± 15z, 55 ± z, 50 ± z) 

where z = Z(r, k, u, d). 

The marginal p-boxes are, for r: 

F x {z) = Fi(z) = P (2\r - 1/2] ^ z) = p(r e [(1 - z)/2, (1 + z)/2\) = z 

because r is uniformly distributed over [0, 1]. For k, we have: 

F 2 (z) = F 2 (z) = p(\k - 30|/15 ^ z) = p(k e [30 - 15*, 30 + 152]) = 1 - (1 - zf 

(see Fig. [7]). Similarly, for u and d, it is easily verified that: 

F 3 (z) = F 3 (z) = F 4 (z) = F 4 (z) = 1 - (1 - zf 



osc(h)(z) 



inf 

(r,k,u,d) : Z(r,k : u 7 d)=z 



h(r, k, u, d) 



osc(h)(z) 



sup 

(r,k,u 7 d) : Z(r,k : u 7 d)=z 



h{r, k, u, d) 



o(z) 
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Figure 8. The function o(z) which determines the lower and upper oscillation, 
and the cut sets. 



with 

o(z) 



3 

H-gt_M^liy if M -/31n(-ln((l + 4/2)^0 



otherwise. 



The function o{z) is depicted in Fig. [5] it is increasing, with o(— 1) = (this is not immediately 
clear from the picture, but at higher scale, it becomes apparent), o(0) = 3.032, and o(l) = +oo. 

Again, osc (h)(z) and osc(h)(z) are decreasing and increasing in z, respectively. Hence the full 
components of the events 

L t = {z e [0, 1] : osc(h)(z) ^ t} = {z e [0, 1] : o{-z) t] 
U t = {ze [0, 1] : osc(h)(z) ^ t} = {z e [0, 1] : o(z) 3s i} 

are of the form L t = [0,£t] and Ut = [ut, 1] again, with 

it = -o _1 (t) for f < o(0) u t = cr 1 ^) for i > o(0) 

As in the previous example, we do not need to elicit the upper cumulative distributions, and 
only the lower ones need to be given. With unknown dependence, using Theorem 1231 we have 

F(z) = max{0, -3 + z + 3(1 - (1 - z) 2 )} 

and whence 

ro(0) 

K{h) = Z(-o _1 (0)di = 1-515 
Jo 

_ r +rx ' 

E(h) = o(0) + (1 - Fio- 1 ^))) dt = 6.423 

Jo(0) 

Therefore, to be on the safe side, we should consider average overflowing heights of at least 6.5m. 
For comparison, using traditional methods instead of p-boxes, h has expectation 3.2m, assuming 
independence between all variables — this lies between the lower and upper expectation that we 
just calculated, as expected. The interval is obviously much wider: 

• because we have reduced a multivariate problem to a univariate one, whence, leading to 
imprecision due to the difference between lower and upper oscillation, 

• and because we have not made any assumption of independence, whence, leading to 
imprecision due to weaker assumptions. 

Realistically, the decision maker may desire a dike height t such that the upper probability of 
disaster E({h ^ t}) is less than a given threshold. It is easily verified that: 

E({h ^ <}) = E({osc(h) t}) = 1 -Z(o _1 (t)) 
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For instance, for E({h ^ t}) = 0.01, we need t to be 10.725m. For comparison, using traditional 
methods instead of p-boxes, t needs to be about 9m, assuming independence between all variables. 

In both examples, analytical calculations are relatively simple due to the monotonicity of the 
target function with respect to the uncertain variables. Of course, this may not be the case in 
general. 



9. Conclusions 

P-boxes are one of the most interesting imprecise probability models from an operational point 
of view, because they are simply characterised by a lower and an upper cumulative distribution 
function. In this paper, for the purpose of multivariate modelling, we studied inferences (lower 
and upper expectations in particular) from p-boxes on arbitrary totally preordered spaces. For 
this purpose, we represented p-boxes as coherent lower previsions, and studied their natural 
extension. 

We used an as general as possible model by considering p-boxes whose lower and upper 
cumulative distribution functions are defined on a totally preordered space. Thereby, we extended 
the theory of p-boxes from finite to infinite sets, and from total orders to total preorders. This 
allowed us unify p-boxes on finite spaces and on intervals of reals numbers, and to extend the 
theory to the multivariate case. 

One very interesting result of this paper is a practical means of calculating the natural ex- 
tension of a p-box in this general setting. We proved that the natural extension of a p-box 
is arbitrarily additive on full components of clopen sets with respect to the partition topology 
induced by equivalence classes of the underlying preorder (Theorem [T3l Corollaries [TBI and [T6|) . 
We also proved that the natural extension is completely monotone, and therefore has a Cho- 
quet integral representation (Theorem ITSl) . Consequently, to calculate the natural extension, we 
proved that it suffices to calculate the full components of the cut sets of the lower oscillation, 
followed by a simple Riemann integral (Proposition IT91 . 

As a special case, we studied p-boxes whose preorders are induced by a real-valued mapping. 
Such p-boxes are particularly attractive, as they allow to build or elicit a multivariate uncertainty 
model at once. They correspond to lower and upper probabilistic bounds given over nested 
regions that can take arbitrary shapes. 

Consequently, we provided a new tool to combine marginal p-boxes into a joint p-box, under ar- 
bitrary rules of combination, thereby allowing any type of dependency modelling (Lemma l22D . As 
examples, we considered two extreme cases: assuming nothing about dependence (Theorem [23)) . 
and assuming epistemic independence (Theorem I25[) . Similar formulas are easily derived for 
any other rule of combination. Moreover, Williamson and Downs's [45 probabilistic arithmetic 
obtains as a special case of our approach. 

We demonstrated our methodology on inference about a damped harmonic oscillator, and on 
a river dike assessment, showing that calculations are generally straightforward. 

Of course, many open problems regarding p-boxes remain to be investigated. For instance, 
even though there need not be any relation between the preorder and the dependency model — 
because one can, in theory at least, always construct a multivariate p-box from marginals for 
any dependency model and any preorder — some combinations obviously lead to more imprecision 
than others. Our choice led to simple mathematical expressions, but is perhaps not the best one 
possible in terms of precision. Can the dependency model inform the choice of preorder, to 
arrive at tighter bounds? Also, the connection of p-boxes with other uncertainty models, such 
as possibility measures and clouds, deserves further investigation. 
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